Speaker-independent continuous speech dictation

نویسندگان

  • Jean-Luc Gauvain
  • Lori Lamel
  • Gilles Adda
  • Martine Adda-Decker
چکیده

In this paper we report progress made at LIMSI in speaker-independent large vocabulary speech dictation using newspaper speech corpora. The recognizer makes use of continuous density HMM with Gaussian mixture for acoustic modeling and n-gram statistics estimated on the newspaper texts for language modeling. Acoustic modeling uses cepstrum-based features, contextdependent phone models (intra and interword), phone duration models, and sex-dependent models. Two corpora of read speech have been used to carry out the experiments: the DARPA Wall Street Journal-based CSR corpus and the BREF corpus containing recordings of texts from the French newspaper Le Monde. For both corpora experiments were carried out with up to 20K word lexicons. Experimental results are also given for the DARPA RM task which has been widely used to evaluate and compare systems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Continuous speech dictation in French

A major research activity at LIMSI is multilingual, speaker-independent, large vocabulary speech dictation. In this paper we report on efforts in large vocabulary, speaker-independent continuous speech recognition of French using the BREF corpus. Recognition experiments were carried out with vocabularies containing up to 20k words. The recognizer makes use of continuous density HMM with Gaussia...

متن کامل

Effective lexical tree search for large vocabulary continuous speech recognition

In this paper, we present an e cient calculation of the factored LM probabilities for speeding up the large vocabulary continuous speech recognition. We introduced a novel technique based on the independent calculation of the factored LM probability. The basic idea of the proposed method is that each factored LM probability is calculated on-demand for a new combination of a previous word hypoth...

متن کامل

Iterative unsupervised speaker adaptation for batch dictation

This paper describes an automatic batch-style dictation paradigm in which the entire dictated speech is fully utilized for speaker adaptation and is recognized using the speaker adaptation results. The key point is that the same speech data is used both for recognition as the target and for speaker adaptation. Two steps, speech recognition and speaker adaptation which uses recognition results a...

متن کامل

Improved estimation of supervision in unsupervised speaker adaptation

Unsupervised speaker adaptation plays an important role in \batch dictation," the aim of which is to automatically transcribe large amounts of recorded dictation using speech recognition. In the case of unsupervised speaker adaptation which uses recognition results of target speech as the means of supervision, erroneous recognition results degrade the quality of the adapted acoustic models. Thi...

متن کامل

Long term on-line speaker adaptation for large vocabulary dictation

On-line speaker adaptation is desirable for speech recognition dictation applications, because it o ers the possibility to improve the system with the speaker-speci c data obtained from the user. Since the user will work with such a device over a long period, for a dictation system the long term adaptation performance is more important than the adaptation speed. In contrast to speaker-dependent...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Speech Communication

دوره 15  شماره 

صفحات  -

تاریخ انتشار 1993